Distributed n-player approachability via time and space average consensus

نویسندگان

Dario Bauso

Giuseppe Notarstefano

چکیده

In this paper we consider repeated coalitional games with transferable utilities (TU) over networks. Namely, we consider a set of n players that have to distribute among themselves a vector of rewards (one for each player). In our network version there is no coordinator allocating the rewards, but the agents have to agree on a common time-averaged vector by updating the local estimates of the reward vector. The common time-averaged reward vector has to approach a suitable constraint set, called core of the game, that guarantees that no agents benefit from quitting the grand coalition. We propose a doubly (over time and space) averaging distributed algorithm. At every iteration, each agent first computes a weighted average of its own timeaveraged estimate and those of his neighbors and then generates a new reward vector in order to drive the time-averaged estimate towards a pre-assigned set. The main contribution of the paper is to prove that under certain assumptions, i) all agents’ estimates reach consensus on the true time-averaged reward vector, and ii) the estimates (and thus the true time-averaged reward vector) approach the pre-assigned set. Conditions for this to happen are related to the connectivity over time of the communication topology and to the approachability principle.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Approachability in repeated games: Computational aspects and a Stackelberg variant

We consider a finite two-player zero-sum game with vector-valued rewards. We study the question of whether a given polyhedral set D is “approachable,” that is, whether Player 1 (the “decision maker”) can guarantee that the long-term average reward belongs to D, for any strategy of Player 2 (the “adversary”). We examine Blackwell’s necessary and sufficient conditions for approachability, and sho...

متن کامل

Approachability in Repeated Games: Computational Aspects and a Stackelberg Variant1

متن کامل

Approachability in Population Games∗

This paper reframes approachability theory within the context of population games. Thus, whilst a player still aims at driving her average payoff to a predefined set, her opponent is no longer malevolent but rather extracted randomly at each instant of time from a population of individuals choosing actions in a similar manner. First, convergence conditions are revisited based on the common prio...

متن کامل

Approachability, fast and slow

Approachability has become a central tool in the analysis of repeated games and online learning. A player plays a repeated vector-valued game against Nature and her objective is to have her long-term average reward inside some target set. The celebrated results of Blackwell provide a 1/ √ n convergence rate of the expected point-to-set distance if this is achievable, i.e., if the set is approac...

متن کامل

Zero - Sum Games with Vector - Valued Payoffs

In this lecture we formulate and prove the celebrated approachability theorem of Blackwell, which extends von Neumann's minimax theorem to zero-sum games with vector-valued payoffs [1]. (The proof here is based on the presentation in [2]; a similar presentation was given by Foster and Vohra [3].) This theorem is powerful in its own right, but also has significant implications for regret minimiz...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Distributed n-player approachability via time and space average consensus

نویسندگان

چکیده

منابع مشابه

Approachability in repeated games: Computational aspects and a Stackelberg variant

Approachability in Repeated Games: Computational Aspects and a Stackelberg Variant1

Approachability in Population Games∗

Approachability, fast and slow

Zero - Sum Games with Vector - Valued Payoffs

عنوان ژورنال:

اشتراک گذاری